<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">
<html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta http-equiv="Content-Type" content="text/html; charset=UTF-8" />
<title>Berkeley Biocode, BiSciCol, BigData, Identifiers, and VertNet (3biv)</title>
<link href="../biocodecommons.css" rel="stylesheet" type="text/css" />
</head>

<body>
<h1>Berkeley Biocode, BiSciCol, BigData and Identifiers (3biv) August 13-14, 2012</h1>
<h3>Hosted at the Museum of Vertebrate Zoology  -- Berkeley, CA</h3>

<p>Sponsored by <a href="http://biscicol.blogspot.com/">BiSciCol</a> (NSF), <a href="http://vertnet.org/">VertNet</a> (NSF), and <a href="http://biocodecommons.org/">Biocode Commons</a>, in collaboration with the <a href="http://www.cdlib.org/">California Digital Library</a>.
  
  Recent discussions from the BiSciCol camp have centered around the need for a consistent, scalable approach to identifier creation for natural history collections objects and their derivatives.  Particular strategies discussed include DOIs, UUIDs, invoking namespaces to bin content (e.g. geo: or urn:catalog), generating quasi-unique identifiers (QUIDs) based on content and hashing using MD5, and working with VertNet to assign identifiers based on re-normalized data from Darwin Core Archives.  Further, we have recognized the importance of getting these identifiers back into the hands of the data providers to enable a persistent reference for future updates.  Following from the identifier discussion is adopting strategies for long-term (&gt; 10 years) storage of identifiers, properties associated with identifiers, links, enabling resolution, and publication.     
  <br />
</p>
<p>CDL has three projects of interest to this discussion: EZIDs, Merritt Repository, and DataUp.  VertNet is working closely with GBIF to develop a cloud-based publishing model for natural history collections, accessing a broad range of data of interest to BiSciCol. <a href=""https://docs.google.com/document/d/1mhqulgcaettpyt_hqvaww5rq4o4pz530vqnsg8hqwhy/edit"">The organizing google document for this workshop</a> is still available. <br />
  <br />
  
  <strong>Monday, August 13th</strong><br />
</p>
<ul>
  <li><a href="3BiTriplifierTalk.pptx" target="_blank">BiSciCol Triplifier (Deck)</a></li>
  <li>Keck Engine (Koo)</li>
  <li><a href=" http://www.slideshare.net/jakkbl/jak-3-bi">EZIDs (Kunze)</a></li>
  <li><a href="UCB-3Bi-Merritt.pptx" target="_blank">Merritt Repository (Abrams)</a></li>
  <li><a href="DataUp.pdf">DataUp (Strasser)</a></li>
  <li>VertNet (Bloom)</li>
  <li>Web-archive service (Cruz) </li>
  <li>Opportunities, synergies, challenges.  Integrating technologies and working together.</li>
  <li>VN: General Review of progress, goals, general strategy w/ regard to assembling data</li>
  <li>Synergies &amp; opportunities for collaboration between BiSciCol and VertNet</li>
</ul>
<p><br />
<strong>Tuesday, August 14th</strong></p>
<p>The hackathon goals and outputs live on the <a href="https://github.com/Bombus/pollinator">Bombus/pollinator github page</a>.  Also included was some<a href="https://github.com/mbjones/ezid"> great code provided by Matt Jones</a> at NCEAS to access EZID API which we connected to Pollinator.  John Deck, Aaron Steele, John Kunze, Rob Guralnick, and Nico Cellinese participated in hackathon component.
</p>
<p><strong>Participants</strong><br />
  Rob Guralnick, Colorado University, Boulder<br />
  Nico Cellinese, University of Florida, Gainesville <br />
  Neil Davies, Moorea/UC Berkeley<br />
  John Deck, Moorea/UC Berkeley<br />
  David Bloom, UC Berkeley <br />
  Sarah Hinman, UC Berkeley<br />
  John Kunze, CDL<br />
  Michelle Koo, MVZ/UC Berkeley<br />
  Patricia Cruse, CDL<br />
  Carly Strasser, CDL<br />
  Stephen Abrams, CDL <br />
  Chris Meyer, SI (Remote via google hangout)<br />
  Richard Pyle, Bishop Museum (Remote via google hangout)<br />
  Aaron Steele, UC Berkeley<br />
  Chris Hoffman, UC Berkeley<br />
  <br />
</p>
</body>
</html>

